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ABSTRACT 

Myotonic dystrophy type 1 (DM1) is a dominantly 
inherited neuromuscular disorder resulting from ex- 
pression of RNA containing an expanded CUG repeat 
(CUG exp ). The pathogenic RNA is retained in nuclear 
foci. Poly-(CUG) binding proteins in the Muscleblind- 
like (MBNL) family are sequestered in foci, caus- 
ing misregulated alternative splicing of specific pre- 
mRNAs. Inhibitors of MBNL1-CUG exp binding have 
been shown to restore splicing regulation and cor- 
rect phenotypes in DM1 models. We therefore con- 
ducted a high-throughput screen to identify novel 
inhibitors of MBNL1-(CUG) 12 binding. The most ac- 
tive compound was lomofungin, a natural antimi- 
crobial agent. We found that lomofungin undergoes 
spontaneous dimerization in DMSO, producing dilo- 
mofungin, whose inhibition of MBNL1-(CUG)i 2 bind- 
ing was 17-fold more potent than lomofungin itself. 
However, while dilomofungin displayed the desired 
binding characteristics in vitro, when applied to cells 
it produced a large increase of CUG exp RNA in nu- 
clear foci, owing to reduced turnover of the CUG exp 
transcript. By comparison, the monomer did not in- 
duce CUG exp accumulation in cells and was more 
effective at rescuing a CUG exp -induced splicing de- 
fect. These results support the feasibility of high- 
throughput screens to identify compounds target- 
ing toxic RNA, but also demonstrate that ligands for 
repetitive sequences may have unexpected effects 
on RNA decay. 



INTRODUCTION 

Myotonic dystrophy type 1 (DM1) is the prototypical ge- 
netic disease resulting from a toxic gain-of-function by mu- 
tant RNA (1). RNA toxicity has been implicated in several 
other disorders that share a distinctive set of genetic fea- 
tures: the causal mutations are large polymorphic expan- 
sions of simple tandem repeats that are genetically unsta- 
ble and located in non-coding regions of the genome (2-9). 
The underlying disease mechanisms are heterogeneous and 
postulated to include activation of signalling pathways by 
mutant RNA, aberrant translation that initiates within the 
repeat tract without an AUG start codon and sequestration 
of proteins that bind to long tracts of CUG, CCUG, GGC- 
CUG, GGGGCC, CGG, AUUCU or UGGAA repeats [re- 
cently reviewed by Krzyzosiak et al (10)]. 

In DM1 the expanded sequence is a (CTG) n repeat in 
the 3' untranslated region (UTR) of DM protein kinase 
(DMPK), a gene expressed in muscle, heart and brain (1 1). 
When pathologically expanded beyond 50 repeats, the CTG 
tract is highly unstable in germline and somatic cells, which 
leads to DMPK alleles having several thousand repeats (12). 
The DMPK mRNA containing an expanded CUG repeat 
(CUG exp ) is retained in the nucleus in foci (13,14). Splic- 
ing factors in the Muscleblind-like (MBNL) family, which 
are the predominant r(CUG) n binding proteins in mam- 
malian cells, are sequestered in the foci of CUG exp RNA 
(15,16). The resulting loss of MBNL function causes mis- 
regulated alternative splicing and other changes of the mus- 
cle transcriptome (17-19). For example, mis-splicing of in- 
sulin receptor and chloride ion channel 1 lead to insulin resis- 
tance and muscle hyperexcitability (myotonia), respectively 
(20,21). 

As yet there are no disease-modifying treatments for 
DM1 or other RNA dominant disorders. However, sev- 
eral compounds or oligonucleotides have shown benefi- 
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rial effects in cell or animal models (22-31). For small 
molecules the predominant approach has been to iden- 
tify, from the repertoire of known nucleic acid binders, a 
set of compounds that show preferential binding to CUG 
repeats (22,27-29,32-34). CUG-binding compounds have 
also been assembled using dynamic combinatorial libraries 
(24,26). These studies have indicated that small molecules 
can improve DM1 -related splicing defects by inhibiting 
MBNL-CUG exp binding, thus restoring MBNL function in 
cells. 

We performed a high-throughput screen to identify com- 
pounds that inhibit r(CUG) n binding to MBNL1, the pre- 
dominant MBNL protein of skeletal muscle. Out of 279 
433 compounds in the screen, the most potent inhibitor 
was lomofungin, a natural antimicrobial agent from Strep- 
tomyces lomondensis (35,36). We found that lomofungin 
undergoes spontaneous dimerization in dimethyl sulfoxide 
(DMSO), producing dilomofungin, whose potency was 17- 
fold greater than lomofungin in the same screen. However, 
while dilomofungin displayed greater r(CUG) n affinity and 
stronger MBNL- (CUG) 12 binding inhibition in vitro, it was 
the monomer that showed greater correction of MBNL1- 
dependent splicing in cells. This discrepancy between the 
cell-free and intracellular activities appeared to result from 
a novel effect of dilomofungin on RNA turnover. Dilomo- 
fungin reduced the rate of CUG exp decay in cells, which led 
to a striking increase of CUG exp RNA in nuclear foci. These 
results are cautionary for development of high affinity RNA 
ligands to mitigate RNA toxicity, which in some cases may 
stabilize the target and promote the accumulation of mutant 
RNA in cells. 

MATERIALS AND METHODS 

High-throughput screen for inhibitors of MBNL1-CUG 
RNA binding 

Our homogeneous time-resolved fluorescence energy trans- 
fer (HTRF) assay for inhibitors of MBNL 1 -(CUG) 12 bind- 
ing was previously described (37). In brief, the assay 
employed a chemically synthesized (CUG) 12 oligoribonu- 
cleotide and recombinant human MBNL1 protein. The 
(CUG) 12 oligonucleotide was biotinylated at the 5' end and 
contained a 4 bp GC clamp at 5' and 3' ends to stabilize 
the repeat in a hairpin conformation, with G-C and C-G 
base pairs separated by periodic U-U mismatch in the stem. 
The recombinant MBNL1 had a 105 amino acid deletion 
at the C-terminus, to reduce protein aggregation without 
loss of r(CUG) n binding, a GST tag at the N-terminus and 
a His 6 tag at the C-terminus. The MBNL1-A105-His 6 was 
expressed and purified from Escherichia coli as previously 
described (37). The biotin-(CUG)i 2 RNA and MBNL1- 
A105-His6 protein were mixed in equimolar concentrations 
(20 nM each) and dispensed in 1536 well plates at 2 (xl per 
well. Compounds dissolved in DMSO were then added (23 
nl per well) in a five point dilution series that ranged from 92 
nM to 57.5 |xM. All assay buffers contained 0.05% Tween- 
20 to reduce aggregation effects (38). After 15 min incu- 
bation at room temperature, detection reagents were added 
and HTRF activity was determined on a plate reader (En- 
Vision PerkinElmer, Boston, MA, USA). In the primary 
screen the fluorescence donor was terbium conjugated to 



anti-His 6 antibody that bound to MBNL1-A105-His 6 , and 
the fluorescence acceptor was XL665 conjugated to strep- 
tavidin that bound to biotin-(CUG)i2. The confirmatory 
assay employed a different detection system, consisting of 
AlphaScreen donor beads (conjugated to streptavidin) that 
emitted a singlet oxygen to activate AlphaScreen acceptor 
beads (coated with nickel, PerkinElmer). Both systems were 
shown to detect MBNL1-A105-His6 when in close proxim- 
ity to biotin-CUGn in solution (37). The assay was used 
to screen 279 433 compounds in the Molecular Libraries 
Small Molecule Repository (MLSMR, Supplementary Ta- 
ble SI). A description of the library and selection algorithm 
for the compounds can be found at http://mli.nih.gov/mli/ 
compound- repository/mlsmr- compounds/. Notably, the se- 
lection algorithm was not designed to enrich for RNA bind- 
ing compounds. 



Identification and structural analysis of dilomofungin 

Analysis of lomofungin (Enzo Life Sciences, BML-A245- 
0050) was performed on a Shimadzu LC2010 HPLC 
equipped with a CI 8 reverse-phase column. *H NMR spec- 
tra were recorded at 25° C on a Bruker Avance 400 (400 
MHz) or Bruker Avance 500 (500 MHz) instrument. Chem- 
ical shifts (8) are reported in parts per million (ppm) down- 
field from tetramethylsilane and referenced to the residual 
protium signal in the nuclear magnetic resonance (NMR) 
solvent (CDCI3, 8 = 7.26). Data are reported as chemical 
shift, multiplicity (s = singlet, d = doublet, t = triplet, m = 
multiplet), integration, and coupling constant (J) in Hertz 
(Hz). 13 C NMR spectra were likewise recorded at 25° C on 
a Bruker Avance 400 (100 MHz) or Bruker Avance 500 (125 
MHz). Chemical shifts (8) are reported in parts per million 
(ppm) downfield from tetramethylsilane and referenced to 
carbon resonances in the NMR solvent. High-resolution 
mass spectra were acquired at the University of Buffalo 
mass spectrometry facility, Buffalo, NY, USA. 



Fluorescence titration 

Fluorescence titrations were performed using a Varian Cary 
Eclipse spectrophotometer. A 50 |xM stock of compound 
in IX Hepes buffered saline with 0.5% DMSO and 0.005% 
tween-20 was titrated into 400 jxl of Cy3- labeled (CUG) 10 
RNA, also in the same buffer. After each addition the 
mixture was allowed to equilibrate for at least 10 min, 
or until no change in the fluorescence spectrum was ob- 
served. Changes in fluorescence emission at 565 nm (ex- 
citation at 550 nm) were measured. Raw data were cor- 
rected for dilution-dependent changes, plotted against com- 
pound concentration and fit to the one site binding equation 
(Equation 1): 

y = (^max X x)/(Kd + x) (1) 



Competition dialysis 

Synthetic oligonucleotides (Integrated DNA Technologies, 
Inc., Coralville, IA, USA) were amplified by polymerase 
chain reaction (PCR) as previously described (28,32). The 
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PCR products were used as templates for in vitro transcrip- 
tion (RNAMaxx kit, Stratagene). The transcripts were pu- 
rified on denaturing 10% polyacrylamide (19:1) gels. The 
RNA was visualized by UV shadowing, excised, eluted in 
0.3 M NaCl and precipitated in three volumes of ethanol 
overnight at — 80° C. The samples were resuspended in 10 
mM phosphate buffer (pH 7.2). The RNA concentration 
was determined by absorbance at 260 nm using the cor- 
responding extinction coefficients. Extinction coefficients 
were determined using HyTher version 1 .0 (39,40) based on 
nearest neighbor analysis (41). 

A fresh 3.27 mM stock solution of lomofungin was pre- 
pared in 500 |ul1 of DMSO and stored at -20°C. An aliquot 
of this stock solution was dissolved in 200 ml of phosphate 
buffer (8 mM NaH 2 P0 4 and Na 2 HP0 4 1 10 mM KC1, 2 mM 
MgCl 2 , 2 mM CaCl 2 and 1 mM Na 2 EDTA at pH 7.0). A 
similar procedure was used to prepare a 10 mM stock so- 
lution of dilomofungin. Slide- A-Lyzer MINI dialysis units 
(Pierce Biotechnology, Rockford, IL, USA) were used for 
competition dialysis assays. Each unit was evaluated before 
use by dialyzing against water for 24 h. The volume in each 
unit was measured, and units with volumes exceeding the 
original 200 |xl were discarded. 

Competition dialysis was performed as previously de- 
scribed (42). Briefly, dialysis units containing 0.1 ml of 1.56 
|xM full length RNA were placed into 200 ml of a 1 |xM 
ligand solution in phosphate buffer. The samples were al- 
lowed to equilibrate with the dialysate by stirring at 200 rpm 
for 24 h at room temperature (20-22° C). Previous exper- 
iments have shown that this interval is sufficient for sam- 
ples to reach equilibrium (42,43). For experiments to com- 
pare ligand binding to MBNL1 protein with RNA hairpins 
containing incremental numbers of 5 / CUG/3 / GUC motifs, 
1.33 |xM of each biomolecule was placed in the dialysis 
units. 

At the end of the equilibration period, 67.5 |xl of each 
sample was carefully removed from the dialysis unit and 
transferred to a microcentrifuge tube. To each sample, 7.5 
(xl of 10% (w/v) sodium dodecyl sulfate (SDS) was added, 
in order to dissociate the lomofungin or dilomofungin from 
nucleic acid or MBNL1. The SDS step was completed to 
ensure accurate determination of the concentration of the 
bound ligand, as the spectroscopic properties of bound lig- 
ands could differ from unbound ligands. 

The total ligand concentration (Q) within each dialy- 
sis unit was determined spectrophotometrically. Appropri- 
ate corrections were made for the small dilution resulting 
from the addition of the SDS stock solution. In parallel 
the free ligand concentration (Cf) was determined from an 
aliquot of the dialysate solution, which typically did not 
vary appreciable from the initial 1 jxM concentration. Ab- 
sorbance and fluorescence measurements were made using 
a UV- Visible spectrophotometer (Beckman Coulter, DU 
800). The bound ligand concentration (Q,) was then deter- 
mined by using Equation (2): 

C b = Q - C f (2) 

Pairwise signed fold changes were calculated from the 
bound ligand concentrations (Q,). P- values for differential 
ligand binding were determined by unpaired Mests, and a 



significance thresholds were false discovery rate (FDR) con- 
trolled for multiple testing. 

Lucif erase assay 

The flucO and fluc800 C2C12 lines (also known as Cl-S and 
C5-14) that express firefly luciferase fused to the DMPK 3' 
untranslated region with 0 or 800 CTG repeats were previ- 
ously described (23). Assays for luciferase activity were per- 
formed as previously described (26), 3 days after addition 
of vehicle or drug. 

Fluorescence in situ hybridization and immunofluorescence 

Fluorescence in situ hybridization (FISH) for CUG re- 
peat RNA and immunofluorescence for MBNL1 were per- 
formed as previously described (30) 3 days after addition of 
vehicle or drug. 

Quantitative and semi-quantitative RT-PCR 

TaqMan qRT-PCR analysis of flucO and fluc800 transcript 
levels was performed using a primer probe set that spanned 
a chimeric intron in the 5' UTR of the transgene. 

Probe: 5' /56-FAM/CACAATAACCAGCACGTTGCC 
CA/3IABkFQ/ 3' 

Primer 1: 5' GACTGACCGCGTTACTCC 3' 

Primer 2: 5' AGAATAGGAACTTCGGAATAGGAAC 

y 

The flue transcripts were quantitated using the AACt 
method with GAPDH (ABI, Cat# 4352339E) as endoge- 
nous control. Semi-quantitative RT-PCRs were also per- 
formed for Mapkapk5, Arid2, Nr3cl, Mapkapl and rRNA 
5' external transcribed spacer (ETS) using primers listed in 
Supplementary Table S6. RNA was extracted 3 days after 
addition of vehicle or drug. 

Conditional cell model for analysis of MBNL1 splicing regu- 
latory activity 

Plasmid pLC16 for conditional transcription of 800 CTG 
repeats was previously described (44). To establish a cell 
system for splicing analysis, we stably transfected C2C12 
mouse muscle cells with pLC16. PhiC31 integrase was co- 
expressed to produce single copy integrations. Transfection 
was performed using Nucleofector (Lonza) according to 
the manufacturer's program B-32. After stably transfected 
cells were selected in puromycin, transcription across the 
expanded repeat was activated by Cre-mediated excision of 
a transcription terminator cassette as previously described 
(44). For MBNL1 knockdown, cells were transfected with 
ON TARGET-plus MBNL1 -targeting siRNA (Dharma- 
con) using Lipofectamine RNAiMAX (Invitrogen). For an- 
tisense inhibition of CUG exp -MBNLl binding we nucleo- 
fected cells with 2 |xM of CAG25, a 25-mer morpholino 
oligonucleotide comprised of CAG repeats (30). For RT- 
PCR analysis of Sercal alternative splicing the RNA was 
harvested and cDNA was synthesized as described previ- 
ously (44). PCR amplification was carried out for 22 cycles 
using primers 5 - GCTC ATGGTCCTC A AG ATCTC AC- 3 ' 
and 5 / -GGGTCAGTGCCTCAGCTTTG-3 / , flanking the 
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alternative exon 22. PCR products were separated on 
agarose gels, stained with SYBR Green I (Invitrogen), 
scanned with a laser fluorimager (Typhoon, GE Health- 
care) and quantified using ImageQuant as previously de- 
scribed (16). 

RNA decay analysis 

Actinomycin D (Sigma) was added to the media of flucO (no 
repeat) or ffuc800 (800 CTG repeats) cells for up to 8 h in 
the presence or absence of 10 |xM of dilomofungin. Levels 
of transgene mRNA were determined by qRT-PCR as pre- 
viously described (44), normalized tol8S rRNA. 

RESULTS 

High-throughput screen 

A screen for inhibitors of MBNLl-(CUG)i2 binding was 
performed on 279 433 compounds in the MLSMR (Sup- 
plementary Table SI). The primary screen employed recom- 
binant MBNL1 and biotinylated (CUG)i 2 in a solution- 
phase HTRF assay (PubChem AID 2675). Each compound 
was tested in a five-point 1:5 dilution series ranging from 
92 nM to 57.5 |jlM, and concentration-response curve fit- 
ting was performed (37,45). The screen identified 140 com- 
pounds (0.05%) that showed > 60% inhibition at 57 |jlM, ex- 
cluding compounds having a biotin-like structure. Of these, 
135 were available as DMSO solutions and were selected 
for confirmatory and counter screens. These included (i) a 
repeat of the initial HTRF assay (Pubchem AID 493199); 
(ii) an orthogonal confirmatory assay of the same MBNL1- 
(CUG)i2 binding event using a different detection method 
(AlphaScreen assay, PubChem AID 493205); and (hi) a 
counter screen using an HTRF protein-peptide binding as- 
say (Pubchem AID 651843). Ten compounds showed con- 
sistent inhibitory activity in the confirmatory and orthog- 
onal assays but not in the counter assay, and were or- 
dered as powder samples for retesting in the AlphaScreen 
(Supplementary Figure SI). Among these, the most potent 
inhibitor was the antimicrobial agent lomofungin, which 
showed an IC50 more than 2-fold lower than other com- 
pounds in the screen (Figure 1). However, quality control 
analysis revealed the presence of two species in the lomo- 
fungin stock. 

Lomofungin dimerizes in DMSO, producing a stronger in- 
hibitor of MBNL1-CUG RNA binding 

HPLC analysis of lomofungin that was freshly prepared in 
DMSO showed a single peak with a retention time that 
was markedly different from a previously prepared solu- 
tion (Supplementary Figure S2A). Mass spectrometry anal- 
ysis of the unknown compound in the older solution in- 
dicated a mass of 626.2, two AMUs less than twice the 
molecular weight of lomofungin, suggesting dimerization 
with loss of two protons (Supplementary Figure S2B). In- 
frared and 1 H- and 13 C-NMR spectra of the unknown 
compound confirmed this expectation. The structure of 
the dimer, which we designated dilomofungin, featured a 
carbon-carbon bond between the same aromatic carbon on 
two lomofungin monomers (Figure 2A and B, and Supple - 
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Figure 1. HTRF (black squares) and AlphaScreen (open circles) assays 
showed inhibition of biotin-CUGi2 binding to MBNL1-A105-His6 pro- 
tein by lomofungin. Subsequent analyses indicated two molecular species 
in the lomofungin stock. Error bars represent 1 s.d. 
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Figure 2. ^-NMR indicates structure of dimerized lomofungin. (A) Lo- 
mofungin *H NMR (500 MHz, DMSO-d6) ppm: 11.05 (s, 1H), 8.13 (d, J 
= 7.7 Hz, 1H), 7.24 (d, J = 7.9 Hz, 1H), 6.80 (s, 1H), 3.92 (s, 3H). (B) Dilo- 
mofungin l H NMR (500 MHz, DMSO-d6) 8 11.20 (s, 2H), 8.20 (d, J = 
8.0 Hz, 2H), 7.33 (d, J= 8.0 Hz, 2H), 3.93 (s, 6H). Colored arrows indicate 
peaks for the corresponding hydrogens in the lomofungin and dilomofun- 
gin structures. 
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Figure 3. AlphaScreen assay showing that dilomofungin (open triangles) 
is 17-fold more potent as inhibitor of MBNLl-(CUG)i2 binding than lo- 
mofungin (closed squares) [IC 50 s = 42 nM, 95% CI 33 to 53 nM, and 717 
nM, 95% CI 533 to 964 nM, respectively]. 

mentary Figure S3). A limited examination of conditions 
under which lomofungin dimerizes in DMSO suggested 
that dimerization occurs at 4°C or above and proceeds more 
rapidly at higher temperatures and concentrations. For ex- 
ample, ^-NMR analysis of 10 mM lomofungin in deuter- 
ated DMSO solution showed that nearly all lomofungin was 
dimerized after 6 days at room temperature, and by 30 days 
the monomer was undetectable (Supplementary Figure S4). 

The affinities of lomofungin and dilomofungin for Cy3- 
(CUG)io RNA were determined by fluorescence titration 
(Supplementary Figure S5). The change in emission at 565 
nm was plotted against drug concentration, and then fit- 
ted to a one binding site equation. The calculated K^s 
were 998 ± 180 nM and 606 ± 300 nM for lomofungin 
and dilomofungin, respectively. To compare their efficiency 
for inhibiting MBNLl-(CUG)i2 binding, we repeated the 
AlphaScreen assay using the monomer or fully converted 
dimer. Dilomofungin exhibited 17-fold stronger inhibition 
than lomofungin, with IC 50 s of 42 nM (95% CI, 33-53 nM) 
and 717 nM (95% CI, 533-964 nM), respectively (Figure 3). 

RNA binding properties of lomofungin and dilomofungin 

We used competition dialysis to compare the relative bind- 
ing of lomofungin and dilomofungin to different targets. 
First, we examined binding to RNA hairpins containing 1- 
6 consecutive 5 / CUG/3 / GUC internal loops (Figure 4A) as 
compared to MBNL1-A105-His6 protein. Both lomofun- 
gin and dilomofungin showed a > 1.5-fold binding prefer- 
ence for RNA with a single internal 5 / CUG/3 / GUC loop, 
as compared to MBNL1 protein (P = 0.005 and 0.023, re- 
spectively). The binding preference for CUG repeats ver- 
sus MBNL1 protein was increased to > 4-fold for RNA 
with six internal 5 / CUG/3 / GUC loops (P = 0.0044 and 
0.0008, respectively) (Figure 4B and Supplementary Ta- 
bles S2 and S3). Notably, while lomofungin and dilomo- 
fungin both bound at 1 : 1 stoichiometry to RNA with a sin- 
gle 5 / CUG/3 / GUC internal loop, the increase in compound 
binding to RNAs containing additional 5 / CUG/3 / GUC 
motifs was not proportional to the number of repeats, sug- 



gesting negative cooperativity in binding to adjacent inter- 
nal loops. The RNA with six 5 / CUG/3 / GUC internal loops 
bound to lomofungin and dilomofungin with roughly 1:2 
(drug:internal loop) stoichiometry. 

Next, we used competition dialysis to examine lomofun- 
gin and dilomofungin binding to a panel of 10 RNA hair- 
pins that were identical except for a single mismatch in a 
5 / CNG/3 / GNC internal loop. Every possible mismatch ex- 
cept G-U and U-G wobble pairs was tested (Figure 4C). A 
hairpin with perfect complementarity in the stem ('no loop 
RNA) served as control for binding to dsRNA. Dilomo- 
fungin showed modest preference for binding to the U-U 
mismatch as compared to no loop RNA or other possible 
mismatches (P < 0.01) except A- A (Figure 4D and E, and 
Supplementary Tables S4 and S5). Binding characteristics 
of lomofungin were broader but still showed preferential 
binding to U-U and other pyrimidine mismatches, as com- 
pared to purine-purine or purine-pyrimidine mismatches. 

Dilomofungin reduces decay of CUG exp -containing tran- 
scripts 

Previously we found that a 25-mer morpholino antisense 
oligonucleotide comprised of CAG repeats ('CAG25') was 
able to compete with MBNL1 for binding to CUG repeat 
RNA, thus disrupting MBNLl-CUG exp complexes (30). In- 
jection of CAG25 into muscle of transgenic mice produced 
a local increase of nuclear export and translation of CUG exp 
mRNA, indicating that nuclear retention is overcome by 
masking the repeat sequence. To adapt this phenomenon 
for cell culture, we used C2C12 myogenic cells that were sta- 
bly transfected with a construct expressing firefly luciferase 
(flue). The flue cDNA was fused to a DMPK 3' UTR that 
contained 0 or 800 CTG repeats (flucO or fluc800 cells, di- 
agrammed in Figure 5A). As expected, the fluc800 cells ex- 
hibited nuclear foci of CUG exp RNA and MBNL1 protein 
(Figure 6), and nucleofection of CAG25 produced an 8- 
fold increase of luciferase activity, as compared to controls 
that were mock transfected or transfected with an irrele- 
vant morpholino (Figure 5B). In contrast, nucleofection of 
CAG25 had no effect on luciferase expression in flucO cells. 
FISH and immunofluorescence indicated that transfection 
of CAG25 did not fully eliminate the nuclear CUG exp foci 
in fluc800 cells, but did appear to increase the amount of 
MBNL1 staining in the nucleoplasm that was not associ- 
ated with foci (Supplementary Figure S6). These results are 
consistent with previous in vivo observations that CAG25 
injection produced partial release of CUG exp RNA and 
MBNL1 protein from nuclear foci, resulting in increased 
MBNL1 activity and increased nuclear export and trans- 
lation of CUG exp mRNA (30). Notably, CAG25 produced 
a modest reduction of /?wc-CUG exp transcripts in both sys- 
tems ((30) and data not shown), suggesting that increased 
nuclear export may also enhance the turnover of CUG exp 
RNA. 

Next we used fluc800 cells to examine the effects of 
lomofungin and dilomofungin on translation of the fluc- 
CUG exp mRNA. Three days after addition to the culture 
media, dilomofungin produced a dose-dependent increase 
of luciferase activity, reaching 8.1-fold (95% CI ± 0.7) up- 
regulation at 10 |xM (normalized to cells treated with vehi- 
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Figure 4. Competition dialysis indicated that lomofungin and dilomofungin bind to S'CUG/S'GUC internal loops in preference to MBNL1 protein 
or perfectly duplexed RNA. (A) Diagram of hairpin RNAs that contained increasing numbers of 5 f C\JG/3'G\JC internal loops. (B) The amount of 
lomofungin or dilomofungin bound to MBNL1 protein or the indicated RNA during competition dialysis. * indicates ^adjusted < 0-05 for comparison 
of indicated RNA versus MBNL1 binding. (C) Diagram of RNA hairpins with single nucleotide internal loops used for competition dialysis. (D, E) The 
amount of lomofungin (D) or dilomofungin (E) bound to RNAs displaying the indicated single nucleotide internal loops or 'no loop RNA. * denotes 
differential binding of compound to the indicated RNA hairpin as compared to 'no loop RNA, Adjusted < 0-05. All error bars represent 1 s.d. 
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Figure 5. Lomofungin and dilomofungin increased luciferase activity in 
cells that express firefly luciferase fused to the DMPK 3 f UTR containing 
800 CUG repeats. (A) Diagram of the luciferase assay. The stably trans- 
fected C2C12 cells expressed firefly luciferase (flue) with 800 or 0 CUG 
repeats in the 3 f UTR (designated fluc800 or flucO cells, expressing fluc- 
CUG exp or //wc-CUG 0 mRNAs, respectively). Mbnll protein (green cir- 
cles) is sequestered by fluc-C\JG exv mRNA in nuclear foci. The transla- 
tion of fluc-CUG exv mRNA is reduced by nuclear retention. When pro- 
tein interactions of CUG repeats are blocked, the nucleocytoplasmic trans- 
port and translation of fluc-CUG exp mRNA is increased. (B) Validation 
of assay using CAG25 antisense morpholino, a known binder of CUG re- 
peats. Two days after transfection of CAG25, luciferase was upregulated 
in fluc800 but not flucO cells. Luciferase activity was normalized to mock 
transfected controls. Luciferase was not affected by control antisense mor- 
pholino directed against an irrelevant target (exon 362 of Titin). (C) Lu- 
ciferase activity in flucO and fluc800 cells treated with lomofungin or dilo- 
mofungin for 3 days, normalized to cells treated with vehicle alone (1% 
DMSO). (D) Luciferase activity in lomofungin- or dilomofungin-treated 
cells, expressed as a ratio of fluc800 to flucO cells, and normalized to the 
ratio in vehicle-treated cells (same experiments as 'C'). Error bars indicate 
1 s.d., and * denotes P < 0.05. 
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Figure 6. FISH combined with immunofluorescence showed increased nu- 
clear foci of CUG exp RNA in fluc800 cells treated with dilomofungin, and 
increased diffuse nucleoplasmic MBNL1 in cells treated with lomofungin. 
(A, D and G) FISH for CUG exp RNA (red). (B, E and H) Immunofluores- 
cence for MBNL1 protein (green). (C, F, I) Merged images. Dapi shows nu- 
clei in each panel (blue). FISH and immunofluorescence signals are thresh- 
olded and displayed with identical settings for each column. Insets in C, F 
and I represent 3 x magnification of the indicated region. 



cle alone, Figure 5C). In contrast, dilomofungin did not af- 
fect luciferase in flucO cells, and lomofungin did not increase 
luciferase in fluc800 cells. However, lomofungin showed cy- 
totoxicity and luciferase inhibition at this concentration 
(Figure 5C and Supplementary Figure S7), consistent with 
previous observations that at high concentration it inhibits 
transcription and translation in yeast (46). When normal- 
ized to results in flucO cells, lomofungin also produced a 
modest increase of luciferase in fluc800 cells (3.7-fold at 10 
IxM, 95% CI =b 0.4, Figure 5D). 

These observations raised the possibility that dilomo- 
fungin, like CAG25 morpholino, can promote the nuclear 
export and translation of fluc-CUG exp transcripts. Alter- 
natively, it was possible that the increase of luciferase re- 
sulted from greater overall accumulation of the fluc-CUG Qxp 
mRNA, without affecting nuclear export efficiency. To dis- 
tinguish these possibilities, we used FISH to examine nu- 
clear levels of fluc-CUG Qxp mRNA. Three days after addi- 
tion to the culture media, dilomofungin (10 |xM) but not 
vehicle (DMSO) produced a striking increase in the num- 
ber and intensity of nuclear foci in fluc800 cells (Figure 
6G), suggesting a major build-up of fluc-CUG Qxp mRNA. 
Quantitative RT-PCR confirmed a 4-fold increase of flue 
mRNA in dilomofungin-treated ffuc800 cells (Figure 7A). 
By contrast, lomofungin did not affect CUG exp foci (Fig- 
ure 6D). To assess specificity, we used quantitative or 
semi-quantitative RT-PCR to examine housekeeping genes 
Polr2a, 18S rRNA, and four endogenous mRNAs that con- 
tained short runs of CNG repeats (Mapkapl, Mapkapk5, 
Arid2 and Nr3cl, containing 25 CUG, 10 CCG, 11 CCG 
and 17 CAG repeats, respectively) (Figure 7B and Supple- 
mentary Figure S8). None of these transcripts showed a no- 
ticeable increase in response to dilomofungin, which sug- 
gested that the transcript accumulation induced by dilomo- 
fungin was relatively selective for fluc-CUG Qxp mRNA. 

To address the mechanism for CUG exp accumulation, we 
assessed the decay of flue mRNA. The transcription in- 
hibitor actinomycin D (5 jxg/ml) was applied to fluc800 
or flucO cells, in the presence or absence of dilomofungin. 
The rate of fluc800 mRNA decay was markedly reduced by 
dilomofungin (10 |xM), producing a 5-fold increase in the 
calculated half-life (Figure 7C). In contrast, dilomofungin 
had no effect on stability of flucO mRNA, or on levels of 
the 5' ETS of rRNA (Supplementary Figure S7), an rRNA 
fragment that is generated and rapidly degraded in the nu- 
cleus (47). These results suggested that stabilization of fluc- 
CUG exp mRNA by dilomofungin did not result from global 
inhibition of nuclear or cytoplasmic RNA decay. 

Lomofungin improves MBNL1 splicing regulatory activity in 
CUG exp -expressing cells 

Addition of lomofungin (10 |xM) to the culture media led 
to increased diffuse nucleoplasmic staining for MBNL1 
in fluc800 cells, consistent with partial alleviation of the 
MBNL1 sequestration (Figure 6C, F). Interestingly, de- 
spite the increase of CUG exp RNA, the immunofluores- 
cence also appeared to show that some MBNL1 protein 
was not associated with RNA foci in dilomofungin-treated 
cells (Figure 61). To test for effects on MBNL1 splicing reg- 
ulatory activity, we examined sarco / endoplasmic reticulum 



6598 Nucleic Acids Research, 2014, Vol. 42, No. 10 



_0J 

< 

DC 

E 
c 

<v 

M 

c 




B 



DMSO 



IflucmRNA 
H8SrRNA 
I POLR2A mRNA 



Dilomofungin 



Treatment 




flucO 
+DMSO 



100 
80 
60 
40 
20 
0 



flucO 




flucO fluc800 fluc800 

+Dilomo +DMSO +Dilomo 
Clone and Treatment 



fluc800 




2 4 6 
Time(h) 



2 4 6 
Time (h) 



Figure 7. Dilomofungin stabilizes fluc-CUG exv , but not flue- CUG 0 
mRNA. (A) Levels of flue mRNA in flucO and fluc800 cells treated with 10 
|xM dilomofungin or vehicle (DMSO) for 3 days were determined by qRT- 
PCR. Results were normalized to Gapdh and expressed as the fold-change 
relative to vehicle alone. Note that Gapdh Q values did not vary with 
dilomofungin treatment. (B) Expression levels of flue, RNA polymerase II 
polypeptide A (Polr2a) and 18SrKNA in flucO and fluc800 cells treated with 
10 |jlM dilomofungin or vehicle alone for 3 days. Expression levels were 
determined by qRT-PCR normalized to Gapdh. (C) Decay of flue mRNA 
after addition of actinomycin D is shown for dilomofungin (10 |jlM) or ve- 
hicle (DMSO) treated flucO and fluc800 cells. Flue mRNA levels were de- 
termined by qRT-PCR and normalized to 18S rRNA. The 0 h time point 
was set to 100% expression. Error bars indicate 1 s.d., and * denotes P < 
0.05. 



Ca 2+ -ATPase 1 (Sercal). The alternative splicing of Sercal 
exon 22 {Sercal ex22) is highly conserved and strongly pro- 
moted by MBNL1 (48). Among numerous exons regu- 
lated by MBNL1, Sercalzxll is perhaps the most sensi- 
tive and specific known indicator of MBNL1 activity, be- 
cause it shows the largest effect in DM1 that is conserved in 
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Figure 8. Partial rescue of Sercal alternative splicing by CAG25, lomo- 
fungin and dilomofungin in C2C12 cells that were stably transfected with 
a construct for conditional expression of CUG exp RNA. Expression of 
(CUG)8oo was triggered by Cre excision of a transcription terminator cas- 
sette. The upper panel shows representative gel images of RT-PCR as- 
say for inclusion of Sercal exon 22. The lower panel indicates the com- 
bined results of three experiments. Exon 22 inclusion was reduced by 
siRNA knockdown of MBNL1 (siMBNLl) or Cre-mediated activation 
of (CUG)goo expression. Treatment of the (CUG)goo expressing cells with 
10 |xM lomofungin or dilomofungin, or by transfection of CAG25 mor- 
pholino, caused partial rescue of exon 22 splicing (P = 0.001, 0.031 and 
0.007, respectively, relative to vehicle treated cells after Cre). Error bars 
indicate 1 s.d. 



MBNL1 knockout mice and reversible by CUG exp knock- 
down (16,18,31). To clearly delineate the effects of CUG exp 
RNA on splicing regulation, we developed a cell model for 
conditional expression of expanded CUG repeats, again us- 
ing C2C12 murine myogenic cells. C2C12 cells were stably 
transfected with a construct, pLC16, for conditional expres- 
sion of expanded CUG repeats (44). As the cells were ini- 
tially derived, the transcription of the repeat was blocked by 
a floxed transcription terminator element (TTE). The TTE 
was located upstream of a hygromycin selectable marker 
that was fused to the DMPK 3' UTR containing 800 CUG 
repeats. Prior to recombination the cells were hygromycin 
sensitive, did not exhibit CUG exp foci and showed 90 ± 
2% inclusion of Sercalexll (assessed by RT-PCR, Figure 
8), equivalent to non-transfected cells. In non-recombined 
cells the transfection of Mbnll siRNA reduced the inclu- 
sion of Sercal ex22 to 38 ± 1%, confirming that Sercal qx22 
is MBNL1 regulable in C2C12 cells (48). Transfection of 
plasmid encoding Cre recombinase followed by selection in 
hygromycin produced a population of cells that expressed 
CUG exp RNA and displayed nuclear foci (not shown). In 
recombined cells the level of Sercal ex22 inclusion was re- 
duced to 54 ± 3% (Figure 8), consistent with MBNL1 se- 
questration. Transfection of CAG25 increased the inclusion 
frequency back to 77 ± 4% (P = 0.001), consistent with par- 
tial release of sequestered MBNLl. Application of 10 |xM 
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lomofungin also promoted the inclusion of Sercalexll (71 
± 3%, P = 0.031, Figure 8). By comparison, dilomofungin 
caused a smaller but still significant increase of SercalexH 
inclusion (60 ± 2%, P = 0.007, see the Discussion). 

DISCUSSION 

Targeting of RNA with small molecules has been advocated 
as a therapeutic approach, especially when viable protein 
targets are lacking [reviewed by Tor (49) and Thomas and 
Hergenrother (50)]. The utility of RNA targeting is best il- 
lustrated by numerous antibiotic drugs that inhibit bacte- 
rial protein synthesis. Among well-studied examples, it is 
remarkable that these drugs uniformly bind to prokaryotic 
rRNA rather than ribosomal proteins [reviewed by Wilson 
(51)]. Furthermore, while the target engagement, effective- 
ness and tolerability of antibiotics is well established, as a 
class it is noteworthy that the affinity and selectivity of an- 
tibiotics for binding to bacterial rRNA, as opposed to off- 
target sequences in the host transcriptome, is quite modest. 
These and other observations have fuelled interest in devel- 
oping novel compounds that target cellular or viral messen- 
ger or noncoding RNAs. However, the feasibility of target- 
ing RNAs that are much less abundant and less conforma- 
tionally constrained than rRNA remains uncertain, and the 
optimal methods for selecting targets and screening ligands 
have not been determined. 

In this context, DM1 presents an attractive paradigm for 
RNA targeting. First, the biologic rationale is solid because 
the mutant DMPKmRNA is directly pathogenic, due to the 
presence of an expanded repeat (52). Second, the CUG exp 
tract forms a distinctive secondary structure, described as 
a 'slippery' hairpin (53-55). 'Slippery' denotes a tendency 
for U-U mismatches to promote 'breathing' or realignment 
of strands in the hairpin stem. The long r(CUG) n hairpins 
are stable in vitro [T m ~75°C (54)] and presumably also in 
cells. Third, the mean CTG expansion size in muscle tissue 
of DM1 patients is 4400 repeats (18) and the repeat tract 
is fully transcribed (13,14). Thus, while the DMPK tran- 
scripts are not highly abundant [~1 per 10 000 mRNAs in 
muscle (56)], each repeat tract in mutant mRNA presents 
a huge capacity for protein or ligand binding. Based on 
observed packing density of one MBNL1 protein per four 
CUG repeats (57) we estimate that, on average, each mu- 
tant transcript may present ~1000 MBNL1 binding sites. 
Fourth, the mutant transcripts are concentrated in one or a 
few foci in muscle nuclei, occupying a tiny fraction of the nu- 
clear volume. In terms of poly-(CUG) binding proteins, the 
combination of high multivalency and localization in foci is 
likely to drive strong sequestration and functional inactiva- 
tion of MBNL1 (15,16,18,58). In terms of small molecules, 
the same circumstance may drive high levels of ligand bind- 
ing (59,60). Such an effect may explain why lomofungin and 
dilomofungin, without further chemical optimization, are 
exhibiting bioactivity in CUG exp -expressing cells, despite 
target affinities and selectivities that are not highly robust. 
Fifth, recent evidence suggests that many splicing defects 
and symptoms of DM1 are triggered when the last remain- 
ing fraction of MBNL protein is titrated from the nucle- 
oplasm and sequestered in foci (18,61), which implies that 
partial release of MBNL protein may be all that is required 



to improve splicing regulation. And sixth, experience with 
the CAG25 morpholino, whether by local or systemic ad- 
ministration in transgenic mice, has provided proof of con- 
cept that inhibitors of MBNL-CUG exp binding can produce 
rapid correction of splicing defects and phenotypes in vivo 
(30,62). Since the intracellular delivery of CAG25 remains 
challenging, the rationale to develop small molecules having 
similar effects is strong. 

Previous studies have used unbiased screens to iden- 
tify compounds with anti-DMl activity. A pilot screen (13 
000 compounds) employed a cell-based assay to identify 
molecules that modulate a single DM1 -affected splice event 
(63). The reproducibility of the assay was low (30%) and the 
hits were not pursued. Ketley et al. used imaging to perform 
a cell-based screen (16 000 compounds) for molecules that 
inhibited the formation of nuclear CUG exp foci (64). The ac- 
tive compounds showed a reproducible reduction of signal 
intensity in foci, but, in contrast to CAG25, the nuclear ex- 
port of CUG exp RNA was not enhanced. These compounds 
are valuable tools for investigating the still unknown mech- 
anism for localization of CUG exp transcripts in foci. 

In contrast, our study is the first biochemical screen 
for inhibitors of a deleterious RNA-protein interaction in 
DM1 or other tandem repeat expansion diseases. In prin- 
ciple, a screening effort to inhibit RNA:protein recogni- 
tion may have the same conceptual limitations as protein- 
protein binding, namely, the challenge of finding a low 
molecular weight ligand that blocks a macromolecular in- 
teraction involving a large binding interface. However, pre- 
vious screens for protein: RNA binding inhibitors, and par- 
ticularly those involving structured RNA, have shown some 
success, at least in terms of finding bona fide inhibitors of 
in vitro binding (65,66). The current study and our sepa- 
rate publication reporting activities for two compounds that 
showed lower activity in the initial screen [compounds C 
and I in Supplementary Figure SI (67)] support the feasi- 
bility of using high-throughput methods to find ligands that 
inhibit protein recognition of repetitive RNA. It remains to 
be determined whether the number of hits and activity of 
the candidates can be improved by using larger screens or 
custom libraries that are specifically designed for targeting 
nucleic acids. 

In our initial characterization, we found that lomofun- 
gin exhibits many of the characteristics that would be de- 
sired for a small molecule modulator of RNA toxicity. This 
natural product binds to 5 / CNG/3 / GNC internal loops 
with a pyrimidine mismatch in vitro and inhibits the bind- 
ing of (CUG)i2 to MBNL1. In CUG exp -expressing cells it 
increases the diffuse nucleoplasmic localization and splic- 
ing regulatory activity of MBNL1. Further work is needed 
to determine whether it acts as an intercalator or groove 
binder, and whether it is possible to enhance its activity and 
reduce its toxicity by chemical modification or alternative 
modes of multimerization. Although lomofungin was iso- 
lated and characterized as a natural antimicrobial agent in 
the 1960s, and shown to impact transcription and transla- 
tion in yeast at high concentration, its precise intracellular 
targets were never determined, and it is unclear how the 
RNA binding properties we have uncovered are related to 
the previous work. 
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It is noteworthy that phenolic compounds such as lomo- 
fungin may have tendencies to produce non-specific aggre- 
gation and false positive results in high-throughput screens 

(68) . However, aggregation effects are unlikely to account 
for the lomofungin activity in our screening assays. (1) The 
MLSMR library was previously screened for aggregation- 
induced inhibitors and lomofungin tested negative (Pub- 
Chem AID 485341) (38). (2) Detergent (0.05% Treen-20) 
was included in our assays to reduce aggregation effects 

(69) . (3) If aggregation was the mechanism, then inhibi- 
tion of protein-peptide interaction would be expected in the 
counter screen. This was not observed. (4) Out of 566 assays 
in PubChem that included lomofungin, only four showed 
activity with sub-micromolar potency. Among these, the 
potency of lomofungin in our screen was greater than any 
previous assay. These data, along with the bioactivity in 
CUG exp -expressing cells, argue against non-specific signal 
quenching in our screening assays. 

At the time when dimerization was recognized it was no 
longer possible for us to determine the exact ratio of lo- 
mofungin monomer and dimer in the stock that was used 
in the primary screen. However, comparisons across assays 
suggest that material in the primary screen was a mixture 
of both (IC50 value of 29 1 nM in the primary AlphaScreen 
versus 717 nM and 42 nM in subsequent assays for pure 
monomer and fully converted dimer, respectively). Regard- 
less, the activity of monomer was sufficient to secure a place 
for lomofungin among the top 10 hits in the screen, and it 
appears therefore that discovery of a dimer with greater in- 
hibitory activity was largely accidental. This difference be- 
tween binding inhibition by monomer and dimer is perfectly 
aligned with previous findings that assembly of r(CUG) n - 
binding monomers into dimers or higher order multimers 
produced major improvements of (CUG) n affinity, selec- 
tivity and MBNL1 binding inhibition (23,26,28,33,34,70). 
However, in previous studies the monomers were coupled 
using an intersubunit spacer, and empiric optimization of 
the spacer length and composition in some cases produced a 
> 100-fold improvement of the inhibition potency. By com- 
parison, the monomers in dilomofungin are directly cou- 
pled, producing a 17-fold improvement of potency, and re- 
sulting in an IC50 that was 8 -fold lower than previously re- 
ported dimers. It will be interesting to determine whether 
other coupling arrangements of lomofungin monomers can 
further modulate its inhibitory properties. 

Dilomofungin was more potent than other compounds 
in the screening set yet it exhibited troubling cellular effects. 
Our observation that a strong inhibitor caused marked cel- 
lular accumulation of CUG exp RNA was unexpected. We 
are not aware of previous instances in which screens for 
modulators of RNA-protein binding have identified com- 
pounds with dramatic effects on RNA turnover. The mech- 
anism for target stabilization in CUG exp -expressing cells is 
unknown. Conceivably, it may reflect an unintended con- 
sequence of the binding inhibition. Using transcriptome- 
wide analyses, previous studies have shown that knock- 
down of MBNL1 led to increased expression of mRNAs 
that have MBNL1 binding sites in the 3' UTR, suggesting 
that MBNL1 binding has a net destabilizing effect (19,71). 
When averaged across the entire spectrum of MBNL1 tar- 
gets the effect was small (<5% change of the mean expres- 



sion level) (19). However, certain endogenous transcripts 
with multiple MBNL1 binding sites in the 3' UTR showed 
a 2-fold increase of mRNA half-life after MBNL1 knock- 
down (71). For transcripts with expanded CUG repeats and 
many binding sites in the 3' UTR, it seems possible that ef- 
fects of MBNL1 on mRNA decay are more pronounced. 
If correct, the stabilization of fluc-CUG Qxp by dilomofun- 
gin may simply reflect highly effective inhibition of MBNL1 
binding. By this 'diminishing returns' model, the process 
of optimizing ligands for binding inhibition may inevitably 
lead at some point to transcript accumulation and reduced 
efficacy, which is essentially what we observed with the 
dimerization of lomofungin. However, it is unclear whether 
the previous observations of MBNL1 effects on mRNA 
turnover in the cytoplasm would apply to nuclear retained 
transcripts, where the pathways for mRNA degradation are 
less defined. Also, we observed that CAG25 morpholino 
has the opposite effect of reducing levels of CUG exp ac- 
cumulation (30), though it is possible that antisense bind- 
ing may have distinct effects on metabolic fate. An alter- 
native possibility is that target stabilization may prove to 
be a unique property of dilomofungin, related to specific 
effects on exonuclease processivity or other aspects of nu- 
clear decay, that are avoidable for other inhibitors with sim- 
ilar potency. Whatever mechanisms may apply, our results 
suggest that future efforts to develop binding inhibitors for 
toxic RNA, whether by chemical screens or structural pre- 
dictions, should assess the impact on RNA turnover at an 
early stage. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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